Indexing and mapping of proteins using a modified nonlinear Sammon projection
نویسندگان
چکیده
A modified Sammon algorithm was developed to display a relationship between proteins based on their amino acid composition. In the first stage of the method, a 19-dimensional compositional space of representative proteins was mapped into a 2-dimensional space (2-D) using the original Sammon projection creating a contour map. In the second stage, this contour map was used as a reference for new proteins projected into 2-D. Data analysis showed that proteins belonging to the same structural classes formed characteristic and distinct clusters, which could be potentially useful in the prediction of protein structural classes. However, we observed significant overlapping of the clusters which may explain the limited success of previous protein folding prediction based solely on amino acid composition. Regardless, the modified Sammon projections can generate a unique index for each individually projected protein related to its amino acid composition, which may be a useful tool in the exploratory classification of proteins.
منابع مشابه
INDEXING AND MAPPING OF PROTEINS USING A MODIFIED NONLINEAR SAMMON PROJECTION Nonlinear Sammon Projection of Compositional Space of Proteins can Predict Protein Folding Classes
A modified Sammon's algorithm was applied to display a relationship between proteins based on their amino acid composition. In the first stage of the method the 19-dimensional compositional space of representative proteins was mapped into 2-dimensinal space using the original Sammon projection to create a contour map. In the second stage, the contour map was used as a reference for newly projec...
متن کاملA nonlinear projection method based on Kohonen's topology preserving maps
A nonlinear projection method is presented to visualize high-dimensional data as a 2D image. The proposed method is based on the topology preserving mapping algorithm of Kohonen. The topology preserving mapping algorithm is used to train a 2D network structure. Then the interpoint distances in the feature space between the units in the network are graphically displayed to show the underlying st...
متن کاملNonlinear Multidimensional Data Projection and Visualisation
Multidimensional data projection and visualisation are becoming increasingly important and have found wide applications in many fields such as decision support, bioinformatics and web/document organisation. Various methods and algorithms have been proposed as either nonparametric or semiparametric approaches. This paper provides an overview of the subject and reviews some recent developments. R...
متن کاملRetraining the Neural Network for Data Visualization
In this paper, we discuss the visuaHzation of multidimensional data. A well-known procedure for mapping data from a high-dimensional space onto a lower-dimensional one is Sammon's mapping. The algorithm is oriented to minimize the projection error. We investigate an unsupervised backpropagation algorithm to train a multilayer feed-forward neural network (SAMANN) to perform the Sammon's nonlinea...
متن کاملOptimal decisions in combining the SOM with nonlinear projection methods
Visual data mining is an efficient way to involve human in search for a optimal decision. This paper focuses on the optimization of the visual presentation of multidimensional data. A variety of methods for projection of multidimensional data on the plane have been developed. At present, a tendency of their joint use is observed. In this paper, two consequent combinations of the self-organizing...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of Computational Chemistry
دوره 20 شماره
صفحات -
تاریخ انتشار 1999